Language corpora

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Language-specific encoding in endangered language corpora

The paper addresses problems of corpus building and retrieval resulting from codeswitching, which is a characteristic feature of endangered language recordings. The typical appearance of code-switching phenomena is first outlined on the basis of data collected in the DoBeS ‘ECLinG’ project, which dealt with three endangered Caucasian languages spoken in Georgia: Tsova-Tush (Batsbi), Udi, and Sv...

متن کامل

Sign Language Recognition: Working with Limited Corpora

The availability of video format sign language corpora limited. This leads to a desire for techniques which do not rely on large, fully-labelled datasets. This paper covers various methods for learning sign either from small data sets or from those without ground truth labels. To avoid non-trivial tracking issues; sign detection is investigated using volumetric spatio-temporal features. Followi...

متن کامل

Word clustering with parallel spoken language corpora

In this paper we introduce a word clustering algorithm which uses a bilingual, parallel corpus to group together words in the source and target language. Our method generalizes previous mutual information clustering algorithms for monolingual data by incorporating a statistical translation model. Preliminary experiments have shown that the algorithm can e ectively employ the constraints implici...

متن کامل

Advanced Distribution Means for Spoken Language Corpora

This report outlines the distribution of Spoken Language Corpora on traditional CD-ROM media and a new approach via network. High capacity CD-ROMs are being introduced, but this is only a marginal improvement in respect to the distribution of SLC. Network access however offers many opportunities: customized SLC, on-line access, and a high degree of protection. However, for network access to be ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jezikoslovni zapiski

سال: 2015

ISSN: 1581-1255,0354-0448

DOI: 10.3986/jz.v9i1.2604